Sequence randomness and polymer collapse transitions. 
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Contrary to expectations based on Harris' criterion, chain disorder with frustration can modify 
the universality class of scaling at the theta transition of heteropolymers. This is shown for a model 
with random two-body potentials in 2D on the basis of exact enumeration and accurate Monte 
Carlo results. When frustration grows beyond a certain finite threshold, the temperature below 
which disorder becomes relevant coincides with the theta one and scaling exponents definitely start 
deviating from those valid for homopolymers. 
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In recent years, work on polyampholytes and on 
biomolecules, like proteins, has focused much attention 
on the conformational properties of inhomogeneous poly- 
meric chains (heteropolymers) JjJ] . Some of these proper- 
ties, like protein folding, are expected to be determined 
by the specific sequence of different monomers constitut- 
ing the chain In this context, particularly relevant is a 
statistical point of view, according to which the behavior 
of large ensembles of different sequences is globally tested 
with philosophy and methods of the physics of quenched 
disordered systems || . Such studies are intended to pro- 
vide information on the conditions under which speci- 
ficity becomes important (i.e., disorder becomes relevant) 
and to give global descriptions of its possible effects. 

The collapse from swollen to compact globular state 
of a long macromolecular chain is by now well under- 
stood in the case of homopolymers. At temperature T, 
a chain with N » I equal monomers has an average 
radius of gyration Rn oc N u ( t > . v{T) is equal to the 
exponent of a self-repelling chain (SAW) Q for all tem- 
peratures T > Tg, and to l/d for T < Tg, as appro- 
priate for a compact object. v{Tg) = vg has a distinct, 
intermediate value, known both in 3D (1/2) and in 
2D (4/7) 0. This transition is triggered by attractive 
interactions between monomers as T decreases. In the 
case of heteropolymers, the theta transition involves the 
same sequence of regimes and preludes to the folding 
phenomenon, for which specificity is surely of key im- 
portance. Thus, one can wonder if chain disorder could 
substantially affect polymer behavior already at the on- 
set of theta collapse, or even at higher T. 

Unlike random environment disorder for a homopoly- 
mer, if amounting to a small perturbation, inhomo- 
geneities in the structure of the chain should in general 
be expected to be irrelevant and not able to affect the 
ordered system's behavior, as far as universal scaling is 
concerned. This is suggested by Harris' criterion |?J as we 
discuss below. In fact, contrary to previous conjectures 
H , recent work on a model of randomly charged polymers 
in 2D and 3-D, has shown that disorder does not change 



the universality class of the theta transition M , consis- 
tent with the most simple scenario one could infer based 
on application of the above criterion. However, in this 
Letter we give evidence that sufficiently large amounts of 
chain disorder and frustration can modify the heteropoly- 
mer theta behavior, with respect to the homopolymer 
one. Thus, in such situations, chain specificity becomes 
a key ingredient in determining the universality class of 
the theta point. Quite remarkably, this point seems to 
fall right at the upper limit of temperatures for which 
disorder plays a relevant role. 

As heteropolymer model we consider here an TV-step 
SAW on square lattice. On each lattice site visited by 
the walk sits a monomer. Monomers i and j (0 < 
hj < N) which are not consecutive along the chain 
(* 3i \j ~ A 7^ 1) an d occupy nearest neighbor lat- 
tice sites in a given configuration, feel an attractive 
potential Vij ( i and j constitute a contact indicated 
by (ij))- Vij is random with probability distribution 
P(Vij) = P 5(V ij +V) + (l-p)S(V ij -V), (V < 0). Its val- 
ues are assigned independently to each pair of monomers 
along the sequence. Models of this kind were already used 
for proteins Q. We choose it here because, since disorder 
is associated to monomer pairs, rather than to individual 
monomers, the annealed partition can be easily mapped 
into a well defined effective homopolymer problem. 

One has to compute free energy and other thermal 
averages for each possible potential arrangement along 
the sequence. Results have then to be further averaged 
over disorder. In addition one wants to establish how 
important fluctuations due to different disorder config- 
urations are in the evaluation of the final results. A 
typical case is that of the free energy: the quenched 
quantity is the disorder average of the logarithm of the 
partition function, Z N {V} = E w exp(- E fo > ew Vy/T), 
where the number of steps of all the chain configura- 
tions u) is implicitly assumed equal to N. If the distri- 
bution of Zn values is sharply peaked around its dis- 
order average, = S/^i.n/y\P(Vij)^jv{V r }, we have 
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In(Zjy) — Iu{Zm) for N — » oo, and disorder plays no 
role, i.e. annealed and quenched free energies, respec- 
tively, are identical. 

Since the potentials for different (ij) are independent 
random variables, the annealed problem reduces to a 
standard homopol ymer one with an effective attractive 
interaction — Tlog(ea;p(— Vij/T). Imagine now to per- 
turb with a slight disorder (p ~ 0) the attractive ho- 



mopolymer situation (p = 1, Vu 



-V). The possible 



relevance of disorder can then be discussed by looking at 
n replicas of the SAW, for which the average partition 
can be put in the form: 
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where a cumulant expansion has been used for disor- 
der averages, and Vtj = V(l — 2p), while SVijSVki — 
4p(l — p)Sij5j t k- Thus, if Vij/T is fixed to the value ap- 
propriate for the renormalization group fixed point of a 
homopolymer at the theta transition, according to eq.l 
the leading perturbation to this fixed point is given by an 
operator proportional to the number I(w 7 , o> ) of distinct 
contacts common to two replicas in configurations w 7 and 
u>s [f[o| • The relevance or irrelevance of the disorder per- 
turbation depends on whether, for the two replicas, the 
average of / grows with N, or not. By exact enumeration 
we studied this average for two replicas of up to 19 steps 
without mutual interactions, and verified to high preci- 
sion that indeed, at the theta point, or even at lower 
temperatures, it saturates to a constant for growing TV. 
This implies irrelevance and could lead to expect that 
also finite amounts of disorder would not be sufficient to 
subtract the theta transition from the control of the ho- 
mopolymer fixed point. In such a scenario the border line 
temperature, Td, below which disorder becomes possibly 
relevant, should always satisfy Td < Tg strictly. 

To determine Td is highly nontrivial. A straightfor- 
ward strategy could consist in extrapolating the ratio 

Z%/Zn to N — > oo. According to general theorems 
prf, if this ratio tends to some finite B > 1, the an- 
nealed free energy should be obtained for a fraction of 
all sequences summing up to a probability > l/B. Thus, 
if B = 1, quenched and annealed problems must coin- 
cide. B is in principle adequate only to establish an 
upper bound on Td- In addition, since quantities like 
Zm are wildly diverging and sensibly oscillating for in- 
creasing N , B estimates are problematic and we have 
to use a different strategy 0. Besides v, also entropic 
exponents characterize SAW scaling. For example, one 
expects ~ N^- 1 I<- N for N ->• oo @. 7 a must 
take on the distinct values (appropriate for homopoly- 
mers) 43/32 and 8/7, for T > T a g and T = T aB §,§, 



respectively, where T a g indicates the theta temperature 
of the annealed problem. For T < T a g the precise value of 
7 a is still debated and may be non-universal, depending 
on lattice and boundary conditions fl2|| . If the polymer 
has one end fixed on an impenetrable boundary in semi- 
infinite geometry, the behavior of the corresponding par- 
tition function, Z\n, changes only to the extent that j a is 
replaced by a boundary exponent 7i a , while the exponen- 
tial growth has the same K a [|Jl3|]. Thus, Zn/Zin grows 
as jV 7a ~~ 7l °, which is more easy to extrapolate (TJ]. If one 
assumes for the quenched free energies ea;p(log(Z/v)) and 
exp(log(Zijf)) similar behaviors with exponents 7 and 
71, respectively, 7 — 71 = 7a — 7ia should hold for ex- 
trapolated differences, as long as annealed and quenched 
free energies coincide. The temperature at which the two 
differences possibly cease to be equal should be identified 
with Td- If T a g > Td, one must also find T a g = Tg and 
all theta exponents in the annealed and quenched prob- 
lems are the same. Td determinations based on checking 
validity of 7 — 71 = 7 a — 71 a are much more precise than 
those based on the analysis of exponential growths, and 
reveal of key importance for our investigation. 
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FIG. 1. Extrapolated ug as a function of p. For details of these 
and other determinations see ref. |sjj. 



The basic tool of our approach is the exact determina- 
tion of all possible contact maps p5[ for a polymer of TV 
steps. The contact map in a given configuration u> is the 
set of contacts (ij) £ 10, After all maps have been sorted, 
since interactions pertain to contacts, the (entropic) free 
energy associated to each map can be evaluated exactly 
once for all, and used in order to perform relatively fast 
averages over the disorder affecting the interactions. This 
latter averaging could be performed either exactly, or by 
extensive Monte Carlo (MC) sampling ( up to 2 x 10 4 
potential configurations) for the longest chains consid- 
ered here. Our contact map algorithm alone can treat 
chains of length exceeding by at least 4 steps the maxi- 
mum length reached in most recent work p6| . 
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We determined for N up to 22 and all T the averages 



(R%) = s {y} n fo> p(%)(^){y} 



(2) 



where {R%){V} is the thermally averaged end-to-end dis- 
tance for a particular {V}. (R 2 N ) is expected to grow with 
N — > oo as a power law with the three exponent values 
mentioned above. We expect that effective exponents 
v{M,K,T) = | log((^)/( J R^_ if ))/ log (M/(M - K)), 
should interpolate smoothly between values, which, for 
increasing M, approach the swollen and the compact v 
exponents, for T > Tg and T < Tg, respectively. If the 
trends of approach are monotonic and opposite in the 
two cases, it is also very likely to find that the various 
curves v — v{M, K, T) bend quite rapidly and intersect 
with each other in a narrow region of the (T, v) plane, 
each intersection representing an approximate determi- 
nation of the asymptotic vg. In fact the pattern of In- 
dependence we found for v(M, K, T) is of this kind for 
all values of p we tested. We identified Tg as the center of 
the relatively narrow T range within which the trend of 
various v(M, K,T) changes from monotonically increas- 
ing, to monotonically decreasing with M. This can be 
done by use of suitable data correlators ||. vg is then 
determined by extrapolation of {R%) at Tg. Figs.[l] and 
U show determinations of vg and Tg, respectively. T a g is 
known with high precision based on the mapping of the 
annealed problem onto the effective homopolymer one ( 
Fig.^j). Tg remains remarkably close to T a g in the whole 
range < p < 1. This means that Td should never over- 
come Tg, even for very large p's. That Td > Tg should 
be ruled out is rather plausible, since sequence disorder 
is very unlikely to affect the swollen phase. On the other 
hand, the behavior of vg as a function of p is pretty stable 
and nicely consistent with the homopolymer universality 
class (vg ~ 4/7) only for p < 0.50 -j- 0.55. For larger p's 
vg starts deviating rather markedly (yg ~ 0.64) from 4/7 
and increases up to ~ 0.68 for p close to 1 . Even if un- 
certainties do not allow to identify precisely a different 
plateau for p ^> 0.55, the deviation from the homopoly- 
mer theta point universality class is very clear. This 
evidence is enforced by further results for the crossover 
exponent cj>g defined by ^~(R^)\ T=Te ~ jV^ 9+2ve and 
extrapolated from our determinations of this derivative 
at Tg. The pattern is qualitatively similar to that of 
Fig.|l|. A plateau (<p ~ 0.45) slightly above the exact 
homopolymer (f)g = 3/7 || is seen also in this case for 
p < 0.50 -7- 0.60, while for p ;> 0.60 again clear increasing 
deviations from this value occur. 

The above deviations for p p> 0.50, combined with 
Tg ~ T a g, suggest that, in the upper range of p, Td should 
be very close to, and possibly coincide with Tg. A strict 
coincidence is the most plausible way in which disorder 
could affect the universal theta point properties, while 
leaving Tg — T a g. Quite remarkably our determinations 
of Td based on extrapolations of 7 — 71 and 7 a — 7 al 
are pretty consistent with this conclusion. These expo- 
nents are plotted in Fig.|| for p — 0.6. Td corresponds 



to the splitting of the two curves, which occurs slightly 
below Tg. Althought uncertainties are not small, Td ~ Tg 
is clearly suggested by the overall pattern of determina- 
tions (FigJ|) for p p> 0.5, while for p <; 0.5 one definitely 
finds Td < Tg, consistent with the picture conjectured 
above. 
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FIG. 2. Temperatures playing a role in the problem. 



Frustration is essential in order to produce the above 
change of theta universality class: distributions P{Vij) 
with support restricted to attractive potentials did not 
show such modification. Thus, the change from low- to 
high-p theta regimes could have analogies with a transi- 
tion from ferromagnetic to spin glass ordering 

Summarizing, we gave strong evidence that a suffi- 
cient amount of frustrated sequence disorder can be rel- 
evant for the heteropolymer behavior in the whole range 
T < Tg, determining in particular a universality class 
different from that of homopolymcrs for the theta transi- 
tion. This is strongly suggested by the global consistency 
of our results for various exponents and temperatures of 
both quenched and annealed problems. The conjecture 
that Td = Tg in the high frustration regime, is rather 
natural and implies the intriguing possibility that, when 
frustration is high enough, specificity becomes a key fac- 
tor in the quenched statistics right from where the het- 
eropolymer starts collapsing. 

The limited accuracy and asymptoticity of our deter- 
minations do not allow a precise conjecture on the nature 
of the transition regime in the whole range 0.50 <^ p < I. 
T a e approaches very steeply for p — * 1~ {T a g(p) ~ 
— l/log(I — p)). Also Td should approach for p — > I~ 
since at all T on the p = 1 line the behavior of the sys- 
tem is that of a SAW, controlled by a T = 00 fixed point. 
The proximity of this line could be responsible for the 
increase of our v estimates [y ~ 0.68) for p closer to I. 
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basis of all data we could estimate ^(0.8) = 0.64 ± 0.01 
|l8| , which qualifies as our best exponent determination 
concerning the expected novel theta universality class at 
p J> 0.50. The value of this exponent is surprisingly very 
close to that appropriate for branched polymers in 2D 
& 

We expect that the new theta universality class could 
be found also in more realistic heteropolymer models. 
Polyampholytes with screened interactions and nonzero 
total charge [||J2^] are good candidates. In general the 
universality class at the point should change as soon as 
frustration exceeds a certain threshold. 
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FIG. 3. Example of determination for p = 0.6 
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FIG. 4. Exact (empty dots) and MC (full) data for p = 0.80. 

In order to rule out the possibility that our results 
could be explained just in terms of a very slow crossover 
to homopolymer theta behavior, due to the presence of 
the SAW line, we spent an exceptional effort in extending 
to larger N by MC methods our determinations of (Rjj) 



for the particular case p = 0.80. Experience [|3[|9|] has 
taught us that, besides the difficulty of thermal sampling 
at low T, which can, e.g., be solved by application of mul- 
tiple Markov chain algorithms jlTj], a major limitation of 
MC in this field is that quenched averages have to be car- 
ried out over very large ensembles of chain sequences in 
order to produce reliable results. This is even more com- 
pelling when chain specificity plays a relevant role. By 
extensive simulations based on a multiple Markov chain 
method |l7) (grids of up to 40 temperatures and ~ 600 
different sequences for each N) we obtained extra deter- 
minations of (i?^,) up to N = 64. The log-log plot of 
Fig.^| confirms very nicely the trend of the exact enumer- 
ation results and clearly excludes a crossover. On the 
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